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(54) Title: ANNOUNCED SESSION DESCRIPTION 
(57) Abstract 

The invention provides a method of announcing a description 
of a media session, for example a multimedia conference. In one 
respect, the invention provides a modular method of announcing 
media sessions. This method comprises the steps of generating 
a first base module (410) having a first data structure comprising 
user oriented data relevant to the media session; generating at least 
one media module (421, 422, 423) having a second data structure 
comprising media oriented data necessary for a user to receive a 
respective media stream of the media session; providing a link 
between the first base module and the at least one media module; 
and, announcing the media session by making at least the first base 
module available to potential recipients of the media session, wherein 
the link between the first base module and the at lest one media 
module permits a user to access the at least one media module and 
subsequently receive the media stream. 
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WO 00/36804 PCT/GB99/03871 
ANNOUNCED SESSION DESCRIPTION 

The present invention relates to the announcement of media stream connections 
for a media session over a communications network. 

Multicast transmissions are becoming increasingly common on the Internet. In 
contrast to standard Internet Protocol (IP) point to point transmissions (unicast), IP 
multicast allows the simultaneous transmission of information to a group of recipients 
from a single source. Routing support for IP multicast transmissions is provided by the 
MBone (IP Multicast Backbone) which is a virtual network layered on top of the 
Internet. 

IP multicast allows real-time communications over wide area IP networks and 
typical transmissions include video and audio conferencing, live multimedia training, 
university lectures and transmission of live television programmes. 

A multicast transmission usually consists of a multimedia session made up of 
several individual media streams typically carrying video, audio, whiteboard or raw data. 
Some sessions are persistent, but the majority exist for a specific period of time, 
although need not be continuous. Multicast based transmissions on the MBone differ 
from unicast IP transmissions in that any user receiving the transmission can join the 
session (unless the transmission is encrypted) and to receive a transmission, a user 
need only know the appropriate transmission address and timing information. 

Prior to a multicast transmission an appropriate announcement containing a 
session description is made, usually at an IP group multi-cast address. Standard session 
descriptions are generated using a Session Description Protocol (SDP), as defined in the 
Internet Engineering Task Force's draft RFC 2327. SDP is a simple ASCII text based 
protocol that is used to describe real time multimedia sessions and their related 
scheduling information. SDP messages are wrapped in a carrier protocol, known as a 
Session Announcement Protocol (SAP), which, in addition to containing the necessary 
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IP addressing and routing information for transmission across the Internet or MBone, 
allows the SDP message to be encrypted, signed or compressed. An announcement can 
then be sent at regular intervals to the announcement group address. As an alternative 
to SAP, a session may be announced by placing an SDP message on a World Wide Web 
5 site (WWW) or by sending it to individuals by email or as a unicast transmission inviting 
them to participate. 

An SDP message conveys information about each media stream in the multicast 
multimedia session to allow the recipients to participate in the session. A typical SDP 
message will include the session name and purpose, the time(s) and date(s) the session 
10 will be active, the component media streams of the session and information required to 
participate in each media stream (IP multicast address, port, media format). The SDP 
message may also include details of the sessions bandwidth requirements, an 
encryption key necessary to participate in a secure multicast transmission using public 
key encryption, contact information for the organiser of the multicast session, and a 
15 Unique Resource Indicator (URI) pointing to a WWW or an Intranet web site where 
further information on the session may be found, for example, background information 
relating to the conference. 

The level of participation a user may make in a session or stream depends on its 
purpose. In a multicast television session, typically users would only be able to receive 
20 the session streams whilst in a multicast conference session the communication would 
be bi-directional with a central server (such as group address 120) receiving each 
participants transmissions and relaying them to the other participants. The level of 
participation expected of a user in a session or stream may be explicitly stated in the 
session description or it may be inherent from the session description, for example 
25 when a receive-only application is associated with a media stream type in the session 
description. 



WO 00/36804 PCT/GB99/03871 

3 

A common front end interface used by multicast end users is known as Session 
Directory Rendezvous (SDR). This interface takes the received announcements, decodes 
the SDP message and displays the names of those sessions that are still current in a 
list. The end user may then select one of the listed announcements to view further 
technical and user-oriented details of the announced session. From the displayed 
information, the end user can then select to join individual streams of the session or to 
join the entire session. Once the streams to be joined are selected, SDR starts the 
necessary multicast-enabled multimedia application on the end user's computer, such 
as Vic and Vat, and passes the relevant stream information (a transport port address) 
from the announcement onto the application allowing the application to establish the 
link to the associated IP multicast address and participate in the stream at transmission 
time. Having initiated the applications and passed the relevant transport port address 
SDR plays no further part in the session. 

Recent increased usage and demand for (multi)media sessions has highlighted a 
number of limitations in SDP. SDP limits session descriptions to defining a session 
having a single set of timings that apply to all of the streams within it. A session in 
which a stream starts mid-way through the transmission cannot easily be described 
using SDP. The structure of a session description written in SDP must be a simple linear 
list of streams which may not reflect the intuitive structure of a complex session. SDP 
supports a limited and predefined set of applications which can receive the streams and 
a limited and predefined set of transport mechanisms (e.g. Simple layering, RTP and 
UDP). As guaranteed Quality of Service (QoS) is becoming more and more desirable to 
the consumer and the supplier, the need to define QoS policies for the entire session 
and individual streams in terms of required system resources, bandwidth requirements 
and supported applications also needs to be met. There may also be requirements on 
the prioritisation of streams and subsessions or more complicated rules about receiving 
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streams. A further requirement on the part of the supplier will be the need for charging 
facilities permitting the charging of an end user for a multicast transmission to which 
they subscribe according to the QoS and types of streams received etc. There is little 
scope to include information about QoS policies or charging within the conventional 
structure of an SDP session description, or any metadata about the session. 

According to a first aspect of the present invention there is provided, a method 
of announcing a description of a media session, comprising the steps of: 

generating a first base module having a first data structure comprising user 
oriented data relevant to the media session; 

generating at least one media module having a second data structure comprising 
media oriented data necessary for a user to receive a respective media stream of the 
media session; 

providing a link between the first base module and the at least one media 
module; and, 

announcing the media session by making at least the first base module available 
to potential recipients of the media session, 

wherein the link between the first base module and the at least one media 
module permits a user to access the at least one media module and subsequently 
receive the media stream. 

The present invention provides a modular description system for a media session 
in which session descriptions are constructed in a hierarchical manner providing a 
plurality of levels of information concerning the constituent parts of the described 
session. 

A problem faced with the current distribution of announcements from the single 
announcement group address is that there is a limit to the size of each announcement 
and the frequency with which each can be sent out. In the present invention, it is 
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possible to provide a modular description system in which a distributed announcement 
contains links available to the end user to other portions of the announcement which 
have not been transmitted. 

5 Preferably, the method further comprises the steps of: generating a second base 

module, the second base module containing user orientated data relating to a sub- 
session of the media session; linking the second base module to the first base module; 
and, linking said at least one media module to the second base module. 

In preferred embodiments, the method further comprises the steps of: generating 
10 at least one options module having a third data structure comprising data relating to 
service level criteria required to participate in the media session; and, linking the or each 
options module to a respective base module. 

The data contained in the options module may relate to a quality of service 
policy to be used by the media session or a part thereof. Alternatively, the data 
15 contained in the options module may relate to a security system to be used by the 
media session or a part thereof. The data contained in the options module may further 
relate to a charging system to be used by the media session or a part thereof. 

In preferred embodiments, one or more media module(s) comprise data 
necessary for a user to receive a layered media stream of a respective media session; 
20 and said method further comprises the step of linking the or each media module to one 
or more respective options module(s) containing data relating to a layered mechanism of 
the respective layered media stream necessary for a party to participate in the layered 
media stream. 

The media session may be announced by transmitting all of the constituent modules of 
25 the session description. Alternatively, the media session may be announced by 
transmitting only some of the constituent modules of the session description, with the 
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remaining modules of the session description being subsequently accessible by a user 
using one or more links provided in the modules transmitted. The remaining modules of 
the session description may be held on one or more servers and the one or more links to 
the remaining modules are in the form of URI pointers. Modules of the session 
5 description contain links to modules which are generated subsequent to the 
announcement. 

According to a second aspect of the invention there is provided a computer 
readable storage medium containing data defining at least a part of a description of a 
media session, the session description comprising:- 
10 a first base module having a first data structure comprising user oriented data 

relevant to the media session; 

at least one media module having a second data structure comprising media 
oriented data necessary for a user to receive a respective media stream of the media 
session; 

15 a link between the first base module and the at least one media module; 

Another problem faced by providers of current (multi)media sessions and the 
developers of the associated (multi)media applications is the spread of skills required to 
implement an application that can initiate and manage a real-time data connection over 
a communications network and perform the (multi)media functions the end user would 
20 expect. For example, developers of multimedia applications require teams with skills in 
audio and video coding, network transport protocols, real time programming, user 
interface design and integration techniques. The session description of the present 
invention simplifies this process by allowing the necessary communication channels and 
media streams to be identified in the session description. This information is used by 
25 generic middleware in the form of a session control and communications manager to 
dynamically instantiate the respective streams and channels for the applications at run 
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time. 

Furthermore, until now the only way a QoS policy could be implemented was to 
process a session description to determine which streams of a session could or should 
be run and then to initialise the applications so they connect to the respective streams. 
This required the communications manager not only to know about the session 
requirements and available system resources but also the capabilities of each 
application. 

In a preferred example of the present invention the media modules of a session 
description are checked by the respective multimedia client application prior to QoS 
management, thereby reducing the workload of the communications manager, that is to 
say the respective client applications determine whether the media modules can be 
supported. Furthermore, applications need only request streams from the session 
control system associated with the client since the session control now handles 
centrally the creation and management of streams in real time. This aspect is also the 
5 subject of our co-pending UK patent application 98261 57.1 . 

The present invention simplifies application development and service provision. 
A further problem is that applications should be able to adapt to available network and 
host resources. This is particularly important for multi-party applications operating in 
heterogeneous environments where each party may have different resources available 
20 to them. Furthermore the nature of the heterogeneity may vary over the lifetime of the 
session, for example as network congestion varies or as the terminal resources are 
shared with other applications or other users. The present invention is able to use a 
QoS policy incorporated within the session description to prioritise the allocation of 
resources and to determine whether participation in the session is viable. 
25 A still further problem is that the application developer and service provider 

typically need to address security and charging requirements. The present invention 
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allows security and charging policies to be incorporated within the session description 
for use within the session control system to invoke appropriate charging and security 
procedures. Instead of having to develop security and charging functions the application 
developer and service provider need only specify appropriate policies. 

In the present invention application development is simplified by using the 
session description to drive the dynamic management of communication channels and 
to adapt to available resources. It also reduces the problem of handling charging and 
security requirements to a matter of specifying charging and security policies within the 
session description. 

An example of the present invention will now be described in detail with 
reference to the accompanying drawings, in which: 

Figure 1 is a schematic diagram illustrating a multicast transmission across the 

MBone; 

Figure 2 is a schematic diagram illustrating the distribution of an SDP 
announcement; 

Figure 3 is a block diagram of a modular session description of a simple session 
generated in accordance with the present invention; 

Figure 4 is a block diagram of a modular session description of a complex 
session generated in accordance with the present invention; 

Figure 5 is a schematic diagram of a system for managing media stream 
connections; 

Figure 6 is a flow chart illustrating the steps involved in managing a media 
session according to the system of Figure 5; and, 

Figure 7 is a flow chart further illustrating a parsing step of Figure 6. 

An example of an IP multicast transmission system is described with reference 
to Figure 1. Prior to a multicast transmission, an appropriate announcement containing 
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a session description is made, thereby allowing end users 1 10a-1 10e to elect to receive 
the transmission. Each end user electing to receive the transmission is linked to a group 
IP Multicast address 1 20 associated with the transmission. At the transmission time of 
the multicast session, the session streams are transmitted from a source 130, or a 
plurality of sources, to the group address. At the group address, the transmission is 
disseminated along the links 140 to those end users who have elected to receive it (in 
this example end users 1 10a-1 10c). 

An example of an announcement and election system is described with 
reference to Figure 2. Most public multicast sessions are announced at a single group IP 
multicast address 200 dedicated to the transmission of announcements to multicast 
end users. End users 210a-210e electing to receive the announcements are linked to 
the announcement group address and, in the same way as an actual session 
transmission, each announcement arriving at the announcement group address is 
disseminated to the end users. A front end interface 220 on each end user's computer 
displays information obtained from the associated session description for each 
announcement. The minimum information a session description may contain is a time 
and date that the session will be active and the group IP multicast address(es) from 
which the end user may elect to receive one or more media streams and to which they 
could send their own streams for the session. Using the front end interface, an end user 
can select the announced session(s), or their component stream(s) they wish to 
participate in. 

Figure 3 is a block diagram of a session description 300 for a simple multicast 
television session. The session description 300 comprises a base module 310 linked to 

a media module 320. 

The base module 310 contains user oriented data relating to the session 
including the title and timing information. The base module 310 may also include a 
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description or abstract, contact information about the organiser and a WWW or an 
intranet URI pointing to a web site containing further information, ideally, the base 
module 310 should contain enough information for the user to decide if they are 
interested in participating in the session. 

The media module 320 contains announcement data relating to a video stream 
of the session. The media module 320 contains the technical information (data) 
necessary for the user to receive the associated media stream. In particular, 
connection, timing and media format details are provided. 

A first example of a session description 300 generated for transmission to end 
users is shown below: 



( 

type=(base) 
id=(310) 

info=(title="live multicast television session") 

source=(name=" A. Sender" email=asender@tx.com) 

media=(video=(di ent ~°dbitsO. 1 6)) 

time=(length=50m repeat=continuous) 

category^C'Entertainment") 

options=(none) 

modules-(m=320) 

) 

( 

type=(media) 

id=(320 310) 

media=(video=(client=odbitsO. 1 6)) 
connection=(229. i . i .2/7000) 
time=(length=50m) 
) 



Session description example 1 



The base module 310 has a unique identifier (id field) used in the generation of 
links between two modules during the processing of the session description. The 
modules field of the base module 310 lists the type and unique identifier of the media 
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module 320 linked to the base module 310. The second identifier in the id field of the 
media module 320 is the unique identifier belonging to the base module 310 linking the 
media module back to the base module 310. By extension, these two-way links permit 
a module tree to be traversed from a base module downwards or from a media module 
upwards. The use of this feature is described later with reference to session description 
example 4. 

The connection field of the media module 320 contains the IP multicast address 
and port number from which the media stream can be received. 

Figure 4 is a block diagram of a session description 400 for a complex multicast 
session of a multimedia conference with two tracks, or sub-sessions, and a panel 
discussion. Each track provides multiparty video and audio conferencing and a shared 
whiteboard for leaving notes and messages. The panel discussion is encrypted and the 
whole conference is subject to a subscription fee payable in advance by each 
participant. 

The session description 400 contains a top level base module 410 linked to 
further base modules 420, 430, 440 and an options module 41 1 . The top level base 
module 410 contains data relating to the overall session including its name, purpose 
and timing information. The options module 41 1 contains details of the payment 
mechanism for subscription fees. 

Each further base module 420, 430, 440 relates to a subsession of the 
conference. Base module 420 relates to the first track of the conference. The base 
module 420 is linked to media modules 421-423, each containing connection, timing 
and media format data for respective video, audio and whiteboard streams. 

The base module 420 is also linked to options module 424 which contains data 
relating to a QoS policy for the first track defining which media modules are optional 
and which are mandatory for a participant of the first track. The mandatory list contains 
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identifiers of those media modules which are needed for the session or subsession to 
operate correctiy whilst the optional list contains identifiers of the media modules that 
are not necessary for the session or subsession to operate correctly if system resources 
are scarce. 

The base module 430 relates to the second track of the conference. It is linked 
to media modules 431-433, each containing connection, timing and media format 
details for respective video, audio and whiteboard streams. The base module 430 is 
also linked to options module 434 which contains data relating to a QoS policy for the 
second track defining which media modules are optional and which are mandatory for a 
participant of the second track. Base module 440 relates to the panel discussion. It is 
linked to media modules 441 and 442, each containing connection, timing and media 
format details for respective video and audio streams of the panel discussion. The base 
module 440 is also linked to options module 443 which contains encryption details lie. 
how and where to get the necessary cryptographic keys) necessary for a participant to 
decode the panel discussion media streams 441, 442 according to a known encryption 
mechanism such as DES or public key encryption. 

The video media stream defined in media module 441 is layered. Layering of 
media streams allows users with different system resources to receive as much of the 
stream as their system resources allows. Every user must receive the bottom layer of 
the stream containing the minimum stream data. However, if a user has sufficient free 
system resources they can receive the next layer up containing enhancements to the 
previous layer. Successive layers can be received enhancing the received media stream 
until the maximum number of layers is received or all free system resources capacity is 
used. The media module 441 is linked to an options module 444 which contains data 
on the layering necessary for the end user to be able to receive the layered stream 
correctiy. 
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The portion of the session description 400 generated for modules 410, 41 1, 420 
and 440 for transmission to end users is shown below in session description example 
2. 



( # overall conference session 
type=(base) 
id=(410) 

info=( tit i e=: "Multirnedia98 Conference") 
source=(o\vn er= " Joe Bloggs" email=joe@n owhere.com) 
media=(video=(client=RealPlayerG2) whiteboard=(dient=wb)) 
time(start= ,, 09:00 GMT 25/ 1 2/98" stop=" 1 3 :00 GMT 25/1 2/98") 
options=(oc=4 1 1) 

modules=(b=420 b=430 b=440 oc=41 1) 

) 

( # conference track 1 
type=(base) 
id=(420 410) 

info=(tit le= " MM98 Systems and Applications Track") 
source=(owner="Joe Bloggs" emaiNjoe@nowhere.com) 
media=(video=(cHent=RealPiayerG2) whiteboard=(client=wb)) 
time(start="09:00 GMT 25/12/98" stop-'M 1 :00 GMT 25/12/98") 
optionsKosq^^ 4 ) 

modules=(m=421 m=422 m-423 osq-424) 

) 

( # session QoS for track 1 
type=(option-sQoS) 
id=(424 420) 
mandatory=(421 422) 
optional=( 423 ) 

) 



( # conference panel discussion 
type=(base) 
id=(440 410) 

info=0*l e= "M M98 Pane! Discussion") 
source=(name="Joe Bloggs" email-joe@nowhere.com) 
media=(video=(client=RealPlayerG2) whiteboard=(dient=wb)) 
time(start="l 1 :00 GMT 25/12/98" stop="13:00 GMT 25/12/98") 
opt ions=(osec=443 ) 
modules=(m=44 1 m=442 osec=443) 

) 

( # video for panel discussion 
type=(media) 
id=(441 440) 

info=(title="MM98 Panel Discussion Video") 
source=(owner="Joe Bloggs" email=joe@nowhere.com) 
media=( v i deo:= (tyP e=live client=RealPlayerG2)) 
connection=(226.0.0. 106/10 10 policy=444) 



WO 00/36804 



PCT/GB99/03871 



14 

time=(start="l 1:00 GMT 25/12/98" stop= H 13:00 GMT 25/12/98") 

) 

( # media QoS policy for panel discussion video 
type=(option-mQoS) 
id=(444 440) 

mechanism=(layer=(base=226.0.0. 1 06/ 1 0 1 0 number=3)) 

) 

( # encryption policy for panel discussion 
type=(option-sec) 
id=(443 440) 

participant=(member=w3c) 

publickey=(location=http://www.w3.org/members_only/) 
info=(location=http://www.w3.org/) 

) 

( # charging policy for entire conference 
type=(option-chg) 
id=(411 410) 
mechanism=(type=AAA) 
price=(fee=1000GBP) 
info=(location=http://www.aaa.net/) 

) 

Session description example 2 



Where there is surplus network bandwidth available, complete session 
descriptions can be announced to end users who may then elect to receive the 
announced session or parts thereof. However, the individual modules of the session 
description do not need to be announced together. If the network bandwidth available 
for announcements restricts the size of session descriptions, only the top level base 
module may be announced. In this situation, the link between modules may be, for 
example, a URI to a WWW or an intranet web site or server, an email address, an IP 

multicast address, an FTP address or details of a file or database stored on a local 

computer system from which an interested user can obtain the remaining modules. 

The following session description example illustrates how the above session 

description for base module 420 would be changed if media module 421 was stored on 

a WWW server: 
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( # conference track 1 
type = (base) 
id=(420 410) 

info = (title = "MM98 Systems and Applications Track") 
source = (owner = " Joe Bloggs" email = joe@nowhere.com) 
media = (video = (client = RealPlay erG2) whiteboard = (client = wb)) 
time(start = "09:00 GMT 25/12/98" stop = "11:00 GMT 25/12/98") 
options = (osq = 424) 

modules = (m = 421 location = http://www.announce.org/cgi-bin/module.cgi7id = 421 
m=421 m=423 osq =424) 

) 

Session description example 3 



Furthermore, top level modules of a session description may be announced well 
in advance of the actual transmission, at a time where the final details of content are 
unknown, in which case the remaining levels may be made available from pre- 
announced links at a later time. 

Figure 5 is a schematic diagram of a system for managing media stream 
connections at a terminal of an end user system according to the present invention. 

The session control system 500 is linked to an announcement receiving interface 
510 and one or more multicast-capable multimedia applications 520. The session 
control system 500 and the announcement receiving interface 510 are connected to a 
network interface 530 via which announcements may be received and multicast 
transmissions may be initiated and/or received. 

Announcements received at the network interface 530 are routed to the 
receiving interface 510. The receiving interface 510 decodes each announcement to 
obtain the session description and displays the user oriented information from the one 
or more base modules in a list to the user. The user is able to select a session 
description from the list announcing a session they wish to receive. The selected 
description is passed to the session control system 500 which determines which of the 
user's multimedia applications 520 are required for participation in the described 
session, starts the applications and initiates and provides the necessary media streams 
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to the respective applications 520 via a communications manager 550. 

The receiving interface 510 may be linked to other Internet communications 
applications 540 such as a WWW browser or an email client (not shown) which may be 
used to gather further information on the described session based on links provided in 
the session description. Also, where an incomplete set of base and/or media modules of 
a session description are received, the receiving interface 510 attempts to obtain the 
remaining modules using the Internet communications applications prior to passing it 
onto the session control system 500. 

Figure 6 is a flow chart showing the steps taken by the session control system 
500 upon receipt of a session description. The description is first parsed in step 600 to 
identify client applications for each media module. Once this is done a second parse is 
carried out where applications are launched in step 610, that is to say for each media 
module start the application specified in the client field if that application has not 
already been started. The portion of the session description relating to the respective 
media type, i.e. the media module, the base module directly above the media module, 
all other modules attached to that base module and any other options modules that 
apply, is passed to the corresponding application in step 620. Since the media modules 
are marked with appropriate client applications, each application will be able to select 
those media streams that it wants to participate in. The application replies to the 
session control system with a connection request specifying its requirements in the 
form of a list of identifiers of media modules from which streams are to be initiated in 
step 630. The connection request is assembled by the session control system in step 
640 and the system then parses the session description to identify other applications to 
launch in step 645. If a further media type is found, steps 610 to 640 are repeated, 
otherwise the session control system uses the assembled connection requests to form 
a list of media modules. This list is passed, together with a session QoS policy, to the 
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communications manager, a system used in by the session control system, which 
determines according to the QoS policies and available system resources whether each 
connection request is viable. 

The session QoS policy is constructed in two steps:- first, the multiple session 
QoS policies relevant for all the media modules to be initiated are combined into one 
session QoS policy: second, the resulting session QoS policy may be adapted to take 
account of (a) user default preferences (defined in a user profile), (b) a user's wish to 
determine the policy interactively, and (c) an application's default configuration (defined 
in the application profile(s)). 

The communications manager responds to the session control system in step 
650 with an indication of the viable media stream connection requests. If necessary, 
the session control system may contact a charging system to initiate accounting for the 
session prior to requesting the communications manager to create the viable media 
stream connections in step 660. 

Once a session starts, each received data stream relating to the session is 
passed to the associated multimedia application in step 670 until the scheduled stream 
time ends in step 680 or the multimedia application requests to the session control 
system that the connection is terminated in step 690, at which point the session 
control system disconnects the connection in step 700. 

Figure 7 is a flow chart showing the QoS management step 650 of Figure 6 in 
greater detail. 

Having received the assembled list of connection requests, the communications 
manager matches each item of this list to a media profile in step 705. A media profile 
defines requirements which must be met for the requested media stream to operate on 
the end user's computer including the minimum network bandwidth needed for 
satisfactory reception of the stream. 
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A terminal profile is determined in step 710. The terminal profile defines the 
resources which are available at the end user's computer for use by the requested 
media streams. This includes available network bandwidth, free memory and disk space 
and available hardware such as monitor size, processor speed and free audio and video 
5 capture devices. The media profile of each connection request is compared against the 
available system resources defined by the terminal profile in step 720. If the terminal 
profile matches or exceeds the media profile, the connection request is declared viable 
in step 730 and the terminal profile is decremented accordingly for the remaining 
connection requests in step 740. Each connection request is processed until there are 
10 no remaining requests or until the media profile of a request exceeds the terminal 
profile. In this situation, the communications manager determines the optimum terminal 
profile the user's computer would have if all non-essential applications were not running 
in step 750 and whether the computer is capable of fulfilling the media profile in step 
760. If the computer is capable of fulfilling the media profile, the communications 
15 manager attempts to free system resources from currently allocated streams or 
connection requests which have lower priority or by asking the user to terminate other 
non-essential applications running on the computer in step 770. Alternatively, this could 
be done by reducing the number of layers received from a layered stream transmission. 
If sufficient resources cannot be found an exception is reported to the user and the 
20 connection request is marked as unviable. If the media stream that cannot be received 
is defined as mandatory in a QoS policy for a media session or subsession, all the 
connection requests for that media session or subsession are cancelled in step 790. If, 
however, the media stream is optional, the communications manager continues 
processing further connection requests in step 720. Once all pending connection 
25 requests have been processed, the communications manager reports those that are 
viable to the session control system. 
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The processing of a session description will now be described with reference to 
Figure 4 and session description example 4 which is the session description generated 
for Track 1 (modules 410 and 420-424 of Figure 4). 



# overall conference session 

type=(base) 

id=(410) 

info=(title= M Multimedia98 Conference") 
source=(owner="Joe Bloggs" email=joe@nowhere,com) 
media-(video=(client=RealPlayerG2) whiteboard=(dient=wb)) 
time(start= M 09:00 GMT 25/12/98" stop=" 13:00 GMT 25/12/98") 
options=(oc=00 1 0) 

modules=(t>=420 b=430 b=440 oc=41 1) 



15 



) 



( # conference track 1 
type=(base) 
id=(420 410) 

info=(title="MM98 Systems and Applications Track") 
20 source=(owner^"Joe Bloggs" email=joe@nowhere.com) 

media=(video=(client=RealPlayerG2) whiteboard=(client=wb)) 
time(start="09:00 GMT 25/12/98" stop= M 1 1 :00 GMT 25/12/98") 
options=(osq=424) 

modules=(m=421 m=422 m=423 osq=424) 

25 ) 

( # video for track 1 
type=(media) 
id=(421 420) 

30 info=(Utie= M MM98 Systems and Applications Track Video") 

source=(owner="Joe Bloggs" em a il==j oe@nowhere.com) 
media=( v ideo=(tyP e=live client=RealPlayerG2)) 
connection=(226.0.0.100/l 000) 

time=(start="09:00 GMT 25/12/98" stop="l 1 :00 GMT 25/12/98") 

35 ) 

( # audio for track 1 
type=(media) 
id=(422 420) 

40 info=(title="MM98 Systems and Applications Track Audio") 

source=(owner="Joe Bloggs" email=joe@nowhere.com) 
media=(audio=(type=live formar=g7 1 1 )) 
connection=(226 .0.0. 1 0 1 / 1 00 1 ) 

time=(start="09:00 GMT 25/12/98" stop="l 1 :00 GMT 25/12/98") 

45 ) 

( # whiteboard for track 1 
type=(media) 
id=(423 420) 

50 info=(titl e - 'MM98 Systems and Applications Track Whiteboard") 

source=(owner="Joe Bloggs" email=joe@nowhere.com) 
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media=(whiteboard=(client=wb)) 
connection=(226.0.0. 1 02/ 1 002) 

time=(start="09:00 GMT 25/12/98" stop="l 1 :00 GMT 25/12/98") 

) 

( # session QoS for track 1 
type=(option-sQoS) 
id=(424 420) 
mandatory=(42 1 422) 
optionai=(423) 

) 

Session description example 4 

The session control system, having received the above session description, 
processes the tree structure of the session description starting at base module 410. 
The first module encountered is base module 420. As this is not a media module but it 
does have sub-modules, the session control system continues down this branch to 
media module. 

The media field of the media module 421 already defines the multimedia client 
application required as RealPlayerG2 (a multimedia application of Real Networks Inc) 
thus the session control system ignores it and continues to the next media module. The 
media field of the media module 422 does not have a multimedia client application 
defined, however a format for the audio data is specified. The session control system 
recognises that this particular audio format can be supported by RealPlayerG2 so it 
amends the media field to read client = RealPlayerG2. The next media module 423 has 
already defined a client application as wb so it ignores this module, and it also ignores 
the option module 424. 

The session control system parses the tree structure again in order to launch 
client applications. The first media module 421 specifies that RealPlayerG2 should be 
launched, hence the session control system launches the application on the end user's 
system and keeps a record of this activity. The second media module 422 specifies an 
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application that has already been launched and so the session control system ignores it 
and continues to the next media module. The media module 423 specifies that wb 
should be launched, so the session control system launches the application and keeps a 
record of this activity. 

RealPlayerG2 is passed the media module 421, base module 420 and modules 
422-424. The application processes the media modules given to determine which it can 
handle, and in this case it identifies 421 and 422. Having determined which streams it 
can handle, the application sends a connection request back to the session control 
system requesting connection to the media streams of modules 421 and 422. Similarly, 
wb is passed the media module 423, base module 420, modules 421-422, and the 
module 424. The application processes the given modules as described previously, and 
requests connection to the media stream of modules 423. 

The above connection requests are assembled by the session control system 
into a list, this list is then passed to the communications manager along with the 
session QoS policy module 424. The communications manager determines whether 
each request is viable according to the steps of Figure 7. 

Assuming there are sufficient resources for all the connection requests for 
mandatory media streams, the communications manager passes back a list of viable 
streams to the session control system which then processes the tree again to determine 
the connection data held in the connection field of each media module so it can request 
that the communications manager initiate a connection to the appropriate media stream 
for each of the viable connection requests according to the connection data. The 
session control system then manages the session and its media stream connections as 
is described with reference to steps 670 to 700 of Figure 6. 

Due to the heterogeneity of the Internet and differing capabilities and operating 
environments of end user computer systems, the session control system described has 
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been implemented in Java (Java is a Trade Mark of Sun Microsystems Inc.). The 
announcement receiving interface, Session Directory, receives the announcements and 
passes those selected by the end user to the session control manager implemented as 
an application programming interface running as a background process on the end 
5 user's computer. 

Whilst the present invention has been described with reference to the Internet 
and multicast transmissions, it will be apparent to the reader that the described modular 
session description and the session control system are applicable to the announcement 
and subsequent management of connections to media streams of a (multi)media session 
10 using other known transport mechanisms such as unicast. 

Furthermore, although mechanisms for encryption, charging and other such 
services have not been explicitly described, it would be apparent to the reader that 
appropriate session descriptions and associated functions within the session control 
system for their processing could be readily implemented according to the mechanism 
15 required. 
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CLAIMS 



1 . A method of announcing a description of a media session, comprising the steps 



of: 



5 generating a first base module having a first data structure comprising user 

oriented data relevant to the media session; 

generating at least one media module having a second data structure comprising 
media oriented data necessary for a user to receive a respective media stream of the 

media session; 

10 providing a link between the first base module and the at least one media 

module; and, 

announcing the media session by making at least the first base module available 
to potential recipients of the media session, 

wherein the link between the first base module and the at least one media 
15 module permits a user to access the at least one media module and subsequently 
receive the media stream. 

2. A method according to claim 1 , further comprising the steps of: 

generating a second base module, the second base module containing user 
20 orientated data relating to a sub-session of the media session; 

linking the second base module to the first base module; and, 
linking said at least one media module to the second base module. 

3. A method according to claim 1 or 2, further comprising the steps of: 
25 generating at least one options module having a third data structure comprising 

data relating to service level criteria required to participate in the media session; and, 
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linking the or each options module to a respective base module. 



4. A method according to claim 3, in which the data contained in the options 
module relates to a quality of service policy to be used by the media session or a part 
thereof. 

5. A method according to claim 3 or 4, in which the data contained in the options 
module relates to a security system to be used by the media session or a part thereof. 

6. A method according to any of claims 3 to 5, in which the data contained in the 
options module relates to a charging system to be used by the media session or a part 
thereof. 

7. A method according to any preceding claim, wherein one or more media 
module(s) comprise data necessary for a user to receive a layered media stream of a 
respective media session; and said method further comprises the step of linking the or 
each media module to one or more respective options module(s) containing data relating 
to a layered mechanism of the respective layered media stream necessary for a party to 
participate in the layered media stream. 

8. A method according to any preceding claim, in which the data contained in a 
media module includes data necessary for a user to receive or transmit data or both 
receive and transmit for inclusion in the media session. 

9. A method according to any preceding claim, in which the media session is 
announced by transmitting all of the constituent modules of the session description. 
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10. A method according to any of claims 1 to 8, in which the media session is 
announced by transmitting only some of the constituent modules of the session 
description, with the remaining modules of the session description being subsequently 
accessible by a user using one or more links provided in the modules transmitted. 

11. A method according to claim 10, in which the remaining modules of the session 
description are held on one or more servers and the one or more links to the remaining 
modules are in the form of URI pointers. 

12. A method according to any preceding claim, in which modules of the session 
description contain links to modules which are generated subsequent to the 
announcement. 

13. A computer readable storage medium containing data defining at least a part of a 
description of a media session, the session description comprising:- 

a first base module having a first data structure comprising user oriented data 
relevant to the media session; 

at least one media module having a second data structure comprising media 
oriented data necessary for a user to receive a respective media stream of the media 
session; 

a link between the first base module and the at least one media module; 
wherein the link permits a user to access the at least one media module and 
subsequently receive the media stream. 
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Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive st p 
or industrial applicability; citations and explanations supporting such statem nt 

1 . The present vague and broad formulation of the independent method claim 1 (s 
Item VIII) fails to meet the requirements of Art. 33(3) PCT, because the subject- 
matter does not involve an inventive step. 

The presently available closest prior art is given by document D5. 

D5 discloses (the references in parentheses applying to this document): 

A method of announcing a description of a media session (Abstract), comprising 

the steps of: 

generating a first base module having a first data structure (page 3, section 4.1 
"Multicast Announcements") comprising user oriented data relevant to the media 
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session (page 4, line 28-40: "Thus SDP includes:..."); 

generating at least one media module having a second data structure comprising 

media oriented data necessary for a user to receive a respective media stream of 

the media session (page 5, section 5.1 :"Media information"); 

providing a link to at least one base module (page 6, section :"Obtaining Further 

information about the session" and page 3, section 4.2 :"Email and WWW 

Announcements"); 

and announcing the media session by making at least the first base module 
available to potential recipients of the media session (page 2, line 9-17: "Session 
directories..."). 

wherein the link to at least one base module include more information about a 
session (page 6, section :"Obtaining Further information about the session" and 
page 3, section 4.2 :"Email and WWW Announcements"). 

The present application differs from this prior art in providing a link between the 
first base module and the at least one media module, wherein the link between 
the first base module and the at least one media module permits a user to access 
the at least one media module and subsequently receive the media stream. 

The objective problem would be to find a method that makes it possible for a user 
to subsequently access more than one announced media modules. 

However, D3 already discloses a solution to this problem in using a link in a form 
of a Universal Resource Locator (URL) syntax. 

Indeed D3 describes (page 7, line 4-7) that sessions on the Multicast Backbone 
(Mbone) can be specified using a variant of the Universal Resource Locator (URL) 
syntax and that Vosaic supports not only the navigation of the Mbone's 
informations space, but also real time retrieval of data from arbitrary video 
servers. 

A person skilled in the art would combine the features of D5 and D3 to arrive at 
the desired solution (to find a method that makes it possible for a user to 
subsequently access more than one announced media modules). 

Therefore the essence of the invention is also already known from D5 in 
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combination with D3. 

Consequently, the features of present claim 1 are found by the skilled person in a 
most self-evident matter; they are not based on an inventive step, and claim 1 fails 
to meet the requirements of Art. 33(3) PCT. 

2. Independent claim 13, although phrased as an apparatus claim, is nonetheless a 
simple repetition of the subject-matter of method claim 1 and hence does not 
meet the requirements of the PCT for the same reasons. 

3. The additional features of the dependent claims 2-1 2 are either directly derivable 
from the above cited documents or concern simple embodiments without inventive 
merit in themselves. 

These claims do not, therefore, add inventive matter to the claims upon which 
they are dependent and, as a consequence, do not meet the requirements of 
Articles 33(1) and (3) PCT. 

Re Item VII 

Certain defects in the international application 

1. The independent claims are not in the two-part form required by Rule 6.3(b) PCT, 
with a preamble based on D3 and D5. 

2. The features of the claims are not provided with reference signs placed in 
parentheses (Rule 6.2(b) PCT). 

3. Contrary to the requirements of Rule 5.1 (a)(ii) PCT, the relevant background art 
disclosed in documents D3 and D5 is not mentioned in the description, nor are 
these documents identified therein. 



Re Item VIII 
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Certain observations on the international application 

1 . The independent Claims 1 and 13 do not meet the requirements of Article 6 PCT 
since their subject-matter is not clear for the following reasons: 

1a. The claims mention "A method of announcing a description of a media session...". 

The formulation of the claims 1 and 13 encompasses any form of 
announcements, for example manual announcements, involving no use of 
electronic means. Claims 1 and 13 are therefore not supported by the 
description (Art. 6 PCT) which discloses the announcement of media stream 
connections for a media session over a communications network only (page 1 , 
line 1-2). 

1b. Furthermore it is not understood from whom and from where the media session 
is announced ? 

1c. Further the claims 1 and 13 mention "...generating a first base module having a 
first data structure comprising user oriented relevant to the media session..." 

What is meant by first base module ? What form of module ? Where is the first 
base module generated ? 
What kind of data structure ? 

1d. Furthermore the claims 1 and 13 mention "...generating at least one media 

module having a second data structure comprising media oriented data necessary 
for a user to receive a respective media stream of the media session;...". 

What is the difference between the first base module and the at least second 
media module ? Where are they located ? 

What is the difference between the first and second data structure ? 

1e. Furthermore the claims 1 and 13 mention "...providing a link between the first 
base module and at least one media module...". 
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It is unci ar what is meant by "link" ? 

Presumably the Applicant means a (i.e. WWW) link in the form of an URL link 
(Uniform Resource Locator). 

Moreover, it is unclear, what is meant by "between" ? 

Presumably the first base module is located at a different location than the second 
media module (i.e. at a different WWW server ?) (see description page 6, line 2- 
6). 

1f. Furthermore the claims 1 and 13 mention "...announcing the media session by 
making at least the first base module available to potential recipients of the 
media session,...". 

How is the first base module made available ? 

1g. Furthermore the claims 1 and 13 mention "wherein the link between the first 
module and the at least one media module permits a user to access the at least 
one media module and subsequently receive the media stream...". 

From whom and from where will be the media stream received ? 

2. The formulations "...by making at least the first module available to potential 
recipients of the media session..." and "wherein the link between the first module 
and the at least one media module permits a user to access the at least one 
media module and subsequently receive the media stream..." in claims 1 and 13 
seek do define the alleged invention by referring to features which concern the 
effect which is desired to be achieved and thereby rendering the definition of the 
subject-matter of said claims unclear (Article 6 PCT). See also PCT Guidelines 
Chapter III-4.7. 



Form PCT/Separate Sheet/409 (Sheet 5) (EPO-April 1997) 



